Automatic accent classification using ensemble methods

نویسندگان

  • Fukun Bi
  • Jian Yang
  • Dan Xu
چکیده

Accent classification technologies directly influence the performance of the state-of-the-art speech recognition system. In this paper, we propose a novel scheme for accent classification, which uses decision-templates (DT) ensemble algorithm to combine base classifiers built on acoustic feature subsets. Different feature subsets can provide sufficient diversity among base classifiers, which is known as a necessary condition for improvement in ensemble performance. Compared with those methods of Majority Voting ensemble and Support Vector Machine, our ensemble scheme can achieve the highest performance. On the other hand, we investigate the possible reasons why ensemble systems can provide potential performance, in terms of diversity analysis. In our experiments, a native Mandarin speech corpus and a non-native multi-accent Mandarin speech corpus which contains three typical minorities’ accents in Yunnan, China, are adopted.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Fault Detection of Anti-friction Bearing using Ensemble Machine Learning Methods

Anti-Friction Bearing (AFB) is a very important machine component and its unscheduled failure leads to cause of malfunction in wide range of rotating machinery which results in unexpected downtime and economic loss. In this paper, ensemble machine learning techniques are demonstrated for the detection of different AFB faults. Initially, statistical features were extracted from temporal vibratio...

متن کامل

Optimum Ensemble Classification for Fully Polarimetric SAR Data Using Global-Local Classification Approach

In this paper, a proposed ensemble classification for fully polarimetric synthetic aperture radar (PolSAR) data using a global-local classification approach is presented. In the first step, to perform the global classification, the training feature space is divided into a specified number of clusters. In the next step to carry out the local classification over each of these clusters, which cont...

متن کامل

Automatic Accent Recognition Systems and the Effects of Data on Performance

This paper considers automatic accent recognition system performance in relation to the specific nature of the accent data. This is of relevance to the forensic application, where an accent recogniser may have a place in casework involving various accent classification tasks with different challenges attached. The study presented here is composed of two main parts. Firstly, it examines the perf...

متن کامل

Validation of Synoptic Station Data Using Ensemble Classification on Central Iran

Today, the use of data recorded in synoptic stations of the country is one of the most significant sources of applied research for researchers. Data recorded automatically or manually at synoptic, climatological, and other stations are analyzed for statistical analysis. In this research, the data recorded in the synoptic stations of Iran, which are used to determine the days of dust, were analy...

متن کامل

Assessing the efficacy of benchmarks for automatic speech accent recognition

Speech accents can possess valuable information about the speaker that can be used in intelligent multimedia-based human-computer interfaces. The performance of algorithms for automatic classification of accents is often evaluated using audio datasets that include recording samples of different people, representing different accents. Here we describe a method that can detect bias in accent data...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008